282 research outputs found

    Simulating gene silencing through intervention analysis

    Get PDF
    We propose a novel method for simulating the effects of gene silencing. Our approach combines relevant subject matter information provided by biological pathways with gene expression levels measured in regular conditions to predict the behavior of the system after one of the genes has been silenced. We achieve this by modeling gene silencing as an external intervention in a causal graphical model. To account for the uncertainty associated with the structure learning of the graphical model, we adopt a bootstrap approach. We illustrate our proposal on a Drosophila melanogaster gene silencing experiment

    MIDAW: a web tool for statistical analysis of microarray data

    Get PDF
    MIDAW (microarray data analysis web tool) is a web interface integrating a series of statistical algorithms that can be used for processing and interpretation of microarray data. MIDAW consists of two main sections: data normalization and data analysis. In the normalization phase the simultaneous processing of several experiments with background correction, global and local mean and variance normalization are carried out. The data analysis section allows graphical display of expression data for descriptive purposes, estimation of missing values, reduction of data dimension, discriminant analysis and identification of marker genes. The statistical results are organized in dynamic web pages and tables, where the transcript/gene probes contained in a specific microarray platform can be linked (according to user choice) to external databases (GenBank, Entrez Gene, UniGene). Tutorial files help the user throughout the statistical analysis to ensure that the forms are filled out correctly. MIDAW has been developed using Perl and PHP and it uses R/Bioconductor languages and routines. MIDAW is GPL licensed and freely accessible at . Perl and PHP source codes are available from the authors upon request

    Beneficial Bacteria Isolated from Grapevine Inner Tissues Shape Arabidopsis thaliana Roots

    Get PDF
    We investigated the potential plant growth-promoting traits of 377 culturable endophytic bacteria, isolated from Vitis vinifera cv. Glera, as good biofertilizer candidates in vineyard management. Endophyte ability in promoting plant growth was assessed in vitro by testing ammonia production, phosphate solubilization, indole-3-acetic acid (IAA) and IAA-like molecule biosynthesis, siderophore and lytic enzyme secretion. Many of the isolates were able to mobilize phosphate (33%), release ammonium (39%), secrete siderophores (38%) and a limited part of them synthetized IAA and IAA-like molecules (5%). Effects of each of the 377 grapevine beneficial bacteria on Arabidopsis thaliana root development were also analyzed to discern plant growth-promoting abilities (PGP) of the different strains, that often exhibit more than one PGP trait. A supervised model-based clustering analysis highlighted six different classes of PGP effects on root architecture. A. thaliana DR5::GUS plantlets, inoculated with IAA-producing endophytes, resulted in altered root growth and enhanced auxin response. Overall, the results indicate that the Glera PGP endospheric culturable microbiome could contribute, by structural root changes, to obtain water and nutrients increasing plant adaptation and survival. From the complete cultivable collection, twelve promising endophytes mainly belonging to the Bacillus but also to Micrococcus and Pantoea genera, were selected for further investigations in the grapevine host plants towards future application in sustainable management of vineyards

    Assessment of statistical methods from single cell, bulk RNA-seq, and metagenomics applied to microbiome data

    Get PDF
    BackgroundThe correct identification of differentially abundant microbial taxa between experimental conditions is a methodological and computational challenge. Recent work has produced methods to deal with the high sparsity and compositionality characteristic of microbiome data, but independent benchmarks comparing these to alternatives developed for RNA-seq data analysis are lacking.ResultsWe compare methods developed for single-cell and bulk RNA-seq, and specifically for microbiome data, in terms of suitability of distributional assumptions, ability to control false discoveries, concordance, power, and correct identification of differentially abundant genera. We benchmark these methods using 100 manually curated datasets from 16S and whole metagenome shotgun sequencing.ConclusionsThe multivariate and compositional methods developed specifically for microbiome analysis did not outperform univariate methods developed for differential expression analysis of RNA-seq data. We recommend a careful exploratory data analysis prior to application of any inferential model and we present a framework to help scientists make an informed choice of analysis methods in a dataset-specific manner

    A novel prognostic score to assess the risk of progression in relapsing-remitting multiple sclerosis patients

    Get PDF
    BACKGROUND: At patient-level, the prognostic value of several features that are known to be associated with an increased risk of converting from relapsing remitting (RR) to secondary phase (SP) multiple sclerosis (MS), remain limited.METHODS: Among 262 RRMS patients followed up for ten years, we assessed the probability of developing the SP course based on clinical and conventional and non-conventional magnetic resonance imaging (MRI) parameters at diagnosis and after two years. We used a machine learning method, the Random Survival Forests, to identify, according to their minimal depth (MD), the most predictive factors associated with the risk of SP conversion, which were then combined to compute the Secondary Progressive Risk Score (SP-RiSc).RESULTS: During the observation period, 69 (26%) patients converted to SPMS. The number of cortical lesions (MD=2.47) and age (MD=3.30) at diagnosis, the global cortical thinning (MD = 1.65), the cerebellar cortical volume loss (MD = 2.15) and the cortical lesion load increase (MD=3.15) over the first two years, exerted the greatest predictive effect. Three patients' risk-groups were identified; in the high-risk group, 85% (46 out of 55) of patients entered the SP phase in 7 median years. The SP-RiSc optimal cut-off estimated was 17.7 showing specificity and sensitivity of 87% and 92% respectively, and overall accuracy of 88%.CONCLUSIONS: The SP-RiSc yielded a high performance in identifying MS patients with high probability to develop SPMS, which can help improve management strategies. These findings are the premise of further larger prospective studies to assess its use in clinical settings

    Statistical Test of Expression Pattern (STEPath): a new strategy to integrate gene expression data with genomic information in individual and meta-analysis studies

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>In the last decades, microarray technology has spread, leading to a dramatic increase of publicly available datasets. The first statistical tools developed were focused on the identification of significant differentially expressed genes. Later, researchers moved toward the systematic integration of gene expression profiles with additional biological information, such as chromosomal location, ontological annotations or sequence features. The analysis of gene expression linked to physical location of genes on chromosomes allows the identification of transcriptionally imbalanced regions, while, Gene Set Analysis focuses on the detection of coordinated changes in transcriptional levels among sets of biologically related genes.</p> <p>In this field, meta-analysis offers the possibility to compare different studies, addressing the same biological question to fully exploit public gene expression datasets.</p> <p>Results</p> <p>We describe STEPath, a method that starts from gene expression profiles and integrates the analysis of imbalanced region as an <it>a priori </it>step before performing gene set analysis. The application of STEPath in individual studies produced gene set scores weighted by chromosomal activation. As a final step, we propose a way to compare these scores across different studies (meta-analysis) on related biological issues. One complication with meta-analysis is batch effects, which occur because molecular measurements are affected by laboratory conditions, reagent lots and personnel differences. Major problems occur when batch effects are correlated with an outcome of interest and lead to incorrect conclusions. We evaluated the power of combining chromosome mapping and gene set enrichment analysis, performing the analysis on a dataset of leukaemia (example of individual study) and on a dataset of skeletal muscle diseases (meta-analysis approach).</p> <p>In leukaemia, we identified the Hox gene set, a gene set closely related to the pathology that other algorithms of gene set analysis do not identify, while the meta-analysis approach on muscular disease discriminates between related pathologies and correlates similar ones from different studies.</p> <p>Conclusions</p> <p>STEPath is a new method that integrates gene expression profiles, genomic co-expressed regions and the information about the biological function of genes. The usage of the STEPath-computed gene set scores overcomes batch effects in the meta-analysis approaches allowing the direct comparison of different pathologies and different studies on a gene set activation level.</p

    Meta-analysis of expression signatures of muscle atrophy: gene interaction networks in early and late stages

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Skeletal muscle mass can be markedly reduced through a process called atrophy, as a consequence of many diseases or critical physiological and environmental situations. Atrophy is characterised by loss of contractile proteins and reduction of fiber volume. Although in the last decade the molecular aspects underlying muscle atrophy have received increased attention, the fine mechanisms controlling muscle degeneration are still incomplete. In this study we applied meta-analysis on gene expression signatures pertaining to different types of muscle atrophy for the identification of novel key regulatory signals implicated in these degenerative processes.</p> <p>Results</p> <p>We found a general down-regulation of genes involved in energy production and carbohydrate metabolism and up-regulation of genes for protein degradation and catabolism. Six functional pathways occupy central positions in the molecular network obtained by the integration of atrophy transcriptome and molecular interaction data. They are TGF-β pathway, apoptosis, membrane trafficking/cytoskeleton organization, NFKB pathways, inflammation and reorganization of the extracellular matrix. Protein degradation pathway is evident only in the network specific for muscle short-term response to atrophy. TGF-β pathway plays a central role with proteins SMAD3/4, MYC, MAX and CDKN1A in the general network, and JUN, MYC, GNB2L1/RACK1 in the short-term muscle response network.</p> <p>Conclusion</p> <p>Our study offers a general overview of the molecular pathways and cellular processes regulating the establishment and maintenance of atrophic state in skeletal muscle, showing also how the different pathways are interconnected. This analysis identifies novel key factors that could be further investigated as potential targets for the development of therapeutic treatments. We suggest that the transcription factors SMAD3/4, GNB2L1/RACK1, MYC, MAX and JUN, whose functions have been extensively studied in tumours but only marginally in muscle, appear instead to play important roles in regulating muscle response to atrophy.</p

    A global gene evolution analysis on Vibrionaceae family using phylogenetic profile

    Get PDF
    <p>Abstract</p> <p>Background</p> <p><it>Vibrionaceae </it>represent a significant portion of the cultivable heterotrophic sea bacteria; they strongly affect nutrient cycling and some species are devastating pathogens.</p> <p>In this work we propose an improved phylogenetic profile analysis on 14 <it>Vibrionaceae </it>genomes, to study the evolution of this family on the basis of gene content.</p> <p>The phylogenetic profile is based on the observation that genes involved in the same process (e.g. metabolic pathway or structural complex) tend to be concurrently present or absent within different genomes. This allows the prediction of hypothetical functions on the basis of a shared phylogenetic profiles. Moreover this approach is useful to identify putative laterally transferred elements on the basis of their presence on distantly phylogenetically related bacteria.</p> <p>Results</p> <p><it>Vibrionaceae </it>ORFs were aligned against all the available bacterial proteomes. Phylogenetic profile is defined as an array of distances, based on aminoacid substitution matrixes, from single genes to all their orthologues. Final phylogenetic profiles, derived from non-redundant list of all ORFs, was defined as the median of all the profiles belonging to the cluster. The resulting phylogenetic profiles matrix contains gene clusters on the rows and organisms on the columns.</p> <p>Cluster analysis identified groups of "core genes" with a widespread high similarity across all the organisms and several clusters that contain genes homologous only to a limited set of organisms. On each of these clusters, COG class enrichment has been calculated. The analysis reveals that clusters of core genes have the highest number of enriched classes, while the others are enriched just for few of them like DNA replication, recombination and repair.</p> <p>Conclusion</p> <p>We found that mobile elements have heterogeneous profiles not only across the entire set of organisms, but also within <it>Vibrionaceae</it>; this confirms their great influence on bacteria evolution even inside the same family. Furthermore, several hypothetical proteins highly correlate with mobile elements profiles suggesting a possible horizontal transfer mechanism for the evolution of these genes. Finally, we suggested the putative role of some ORFs having an unknown function on the basis of their phylogenetic profile similarity to well characterized genes.</p

    BrewerIX enables allelic expression analysis of imprinted and X-linked genes from bulk and single-cell transcriptomes

    Get PDF
    Genomic imprinting and X chromosome inactivation (XCI) are two prototypical epigenetic mechanisms whereby a set of genes is expressed mono-allelically in order to fine-tune their expression levels. Defects in genomic imprinting have been observed in several neurodevelopmental disorders, in a wide range of tumours and in induced pluripotent stem cells (iPSCs). Single Nucleotide Variants (SNVs) are readily detectable by RNA-sequencing allowing the determination of whether imprinted or X-linked genes are aberrantly expressed from both alleles, although standardised analysis methods are still missing. We have developed a tool, named BrewerIX, that provides comprehensive information about the allelic expression of a large, manually-curated set of imprinted and X-linked genes. BrewerIX does not require programming skills, runs on a standard personal computer, and can analyze both bulk and single-cell transcriptomes of human and mouse cells directly from raw sequencing data. BrewerIX confirmed previous observations regarding the bi-allelic expression of some imprinted genes in naive pluripotent cells and extended them to preimplantation embryos. BrewerIX also identified misregulated imprinted genes in breast cancer cells and in human organoids and identified genes escaping XCI in human somatic cells. We believe BrewerIX will be useful for the study of genomic imprinting and XCI during development and reprogramming, and for detecting aberrations in cancer, iPSCs and organoids. Due to its ease of use to non-computational biologists, its implementation could become standard practice during sample assessment, thus raising the robustness and reproducibility of future studies
    corecore